Scale-Free Networks are Ultrasmall 
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We study the diameter, or the mean distance between sites, in a scale-free network, having N 
sites and degree distribution p(k) oc fc _A , i.e. the probability of having k links outgoing from a site. 
In contrast to the diameter of regular random networks or small world networks which is known to 
be d ~ IniV, we show, using analytical arguments, that scale free networks with 2 < A < 3 have a 
much smaller diameter, behaving as d ~ In IniV. For A = 3, our analysis yields d ~ In N/ In In N, as 
obtained by Bollobas and Riordan, while for A > 3, d ~ IniV. We also show that, for any A > 2, 
one can construct a deterministic scale free network with d ~ In IniV, and this construction yields 
the lowest possible diameter. 
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It is well known [1-4] that random networks, such as 
Erdos-Renyi networks [5,6] as well as partially random 
networks, such as small- world networks [7], have a very 
small average distance (or diameter) between sites, which 
scales as d ~ IniV, where N is the number of sites. 
Since the diameter is small even for large N , it is com- 
mon to refer to such networks as "small world" networks. 
Many natural and man-made networks have been shown 
to posses a scale free degree distribution, including the 
Internet [8], WWW [3,9], metabolic [10] and cellular net- 
works [11] and also trust cooperation networks [12]. 

The question of the diameter of such networks is fun- 
damental in the study of networks. It is relevant in many 
fields regarding communication and computer networks, 
such as routing [13], searching [14] and transport of in- 
formation [13]. All those processes become more efficient 
when the diameter is smaller. It also might be relevant to 
subjects such as the efficiency of chemical and biochem- 
ical processes and spreading of viruses, rumors, etc. in 
cellular, social and computer networks. In physics, the 
scaling of the diameter with the network size is related to 
the physical concept of the dimensionality of the system, 
and is highly relevant to phenomena such as diffusion, 
conduction and transport. The anomalous scaling of the 
diameter in those networks is expected to lead to anoma- 
lies in diffusion and transport phenomena on those net- 
works. In this Letter we study the diameter of scale-free 
random networks and show that it is significantly smaller 
than the diameter of regular random networks. We find 
that scale free networks with 2 < A < 3 have diameter 
d ~ In In N and thus can be considered as "ultra small 
world" networks. 

We define the diameter of a graph as the average dis- 
tance between any two sites on the graph (unlike the 
usual mathematical definition of the largest distance be- 
tween two sites). Since no embedding space is defined for 
those networks the distance denotes the shortest path be- 



tween two sites {i.e. the smallest number of followed links 
needed to reach one from the other). If the network is 
fragmented we will only be interested in the diameter of 
the largest cluster (assuming there is one). 

To estimate the diameter we will study the radius of 
such graphs. We define the radius of a graph as the av- 
erage distance of all sites on the graph from the site with 
the highest degree in the network (if there is more than 
one we will arbitrarily choose one of them). The diameter 
of the graph, d, is restricted to: 



r < d < 2r, 



(1) 



where r is the radius of the graph, defined as the average 
distance (I) between the highest degree site (the origin) 
and all other sites. 

A scale free graph is a graph having degree distribu- 
tion, i.e. the probability that a site has k connections: 



p{k) = ck 



m,m 4- 1, K, 



(2) 



where c w (A — l)m is a normalization factor, and 
m and K are the lower and upper cutoffs of the distri- 
bution, respectively. The ensemble of such graphs has 
been defined in [15]. However, we will refer here to the 
ensemble of scale free graphs with the "natural" cutoff 
K = miV 1 /^- 1 ) [16-18]. 

We begin by showing that the lower bound on the di- 
ameter of any scale-free graph with A > 2 is of the order 
of In In N, then we show that for random scale free graphs 
with 2 < A < 3 the diameter actually scales as In IniV. It 
is easy to see that the lowest diameter for a graph with 
a given degree distribution is achieved by the following 
construction: Start with the highest degree site, and then 
in each layer attach the next highest degree sites until the 
layer is full. By construction, loops will occur only in the 
last layer. This structure is somewhat similar to a graph 
with assortative mixing [19] - since high degree sites tend 
to connect to other high degree sites. 
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In this kind of graph the number of links outgoing 
from the Ith layer (sites at distance I from the origin), 
Xi, equals the total number of sites with degree between 
Ki, which is the highest degree of a site not reached in 
the Ith layer, and Ki +i , which is the same for the I + 1 
layer. See Fig. 1. This can be described by the following 
equation: 

Xi =N p{k)dk « m^NK}-?. (3) 
Jk 1+1 

The number of links outgoing from the I + 1 layer equals 
the total number of links in all the sites between Ki and 
Ki + i minus one link at every site, which is used to con- 
nect to the previous layer: 

Xl+i = N (k-l)p(k)dk « ^Zlm^NKf-^. (4) 
Jk 1+1 a - z 

Solving those recursion relations with the initial condi- 
tions K = ATV(A-i) and Xo = R() lcadg tQ . 

Xj =o( A - 1 )( 1 -" , )JV 1 -« ,+1 , (5) 
where a = (A - 1)/(A - 2)m, u = (A - 2)/(A - 1), and 
Ki=m(xi/N)T^ . (6) 

To bound the radius, r, of the graph, we will as- 
sume that the low degree sites are connected randomly 
to the center. We choose some degree 1 <C k* w 
(lnlnAr) 1 /(^-i). We can use Eq. (6) to show that if 
h w lnlnTV/ln(A - 2) then K h < k* , so all sites with 
degrees k > k* would have been reached with probabil- 
ity 1 in the first I layers. On the other hand, if we start 
uncovering the graph from any site - provided it belongs 
to the giant component - then at a distance li from this 
site there are at least li bonds. The probability that at 
none of those bonds will lead to a site of degree k* decays 
as (1 - k*p(k*)/(k)) h . So, taking fc*^ 1 < l 2 < In In TV, 
we will definitely reach a site of degree at least k* at 
distance l 2 from almost every site. Since I = h + h, all 
those sites are at a distance of order In In TV from the 
highest degree site, this is the behavior of the radius of 
the graph. Thus, In In TV is a lower bound for the diame- 
ter of scale free networks, and by applying this approach 
one can generate scale free networks with this diameter, 
for A > 2. For A = 2 the construction is somewhat similar 
to the condensate obtained in [20]. 

In the following, we present analytical arguments 
showing that the behavior of d <~ In In TV is actually 
achieved in random non-correlated scale free graphs with 
2 < A < 3. Non-correlated networks are networks in 
which the degree of a site reached by following a link is 
independent of the degree of the site at the other side 
of that link, One can view the process of uncovering the 
network (which is the same as building it) by following 
the links one at a time. For simplicity let us start with 



the site with the highest degree (which is also guaranteed 
to belong to the giant component), whose degree is pro- 
portional to TV 1 /^- 1 ) [16,17]. Next we expose the layers, 
I = 1, 2, 3, . . ., one at a time. To this end we consider the 
graph as built from one large developing cluster, and sites 
which have not yet been reached (they can also belong 
to the giant component or not). A similar consideration 
has been used by Molloy and Reed [21]. 

After layer I is explored the distribution of the un- 
reached sites changes (since most high degree sites are 
reached in the first layers). To take account for this we 
assume that the Ith layer has xi outgoing links. The dis- 
tribution of degrees after uncovering some of the edges 
changes to P'(k) w P{k)exp{-k/K{) [21]. In the limit of 
large TV and large Ki we will assume that after exploring 
this layer the highest degree of the unvisited sites is of 
order Ki, where xi an d Kl are functions of I that will be 
determined later. 

Let us now consider the I + 1 layer. There is a new 
threshold function, that is, the new distribution of un- 
visited sites is like a step function - almost P(k) for 
k < Ki + i and almost for k > Ki + \. The reason is 
as follows. A site with degree k has a probability of 
p = k/(N(k)) to be reached by following a link [22]. If 
there are xi outgoing links then if pxi > 1 we can assume 
that (in the limit TV — > 00) the site will be reached in the 
next level with probability 1. Therefore, all unvisited 
sites with degree k > N(k)/xi will be surely reached in 
the next layer. On the other hand, almost all the unvis- 
ited sites with degree k < N{k) jxi will remain unvisited 
in the next layer - therefore, their distribution will re- 
main almost unchanged. From those considerations the 
highest degree of the unexplored sites in the I + 1 layer 
is determined by: 

K l+1 « TV(fc)/ Xi . (7) 

In the Ith layer the number of loops, i.e. the number of 
links connecting two sites of the Ith layer and the number 
of sites in the I + 1 layer connected to more than one site 
in the Ith layer is proportional to xf/((k)N). Since as 
long as xi i s n °t °f order TV, this fraction is smaller in 
order than Xh we can safely assume that loops can be 
neglected until the last shells have been reached. Similar 
arguments have been used in [16]. 

In the I + 1 layer, all sites with degree k > N(k)/xi 
will be exposed. Since the probability of reaching a site 
via a link is proportional to kP(k), the average degree of 
sites reached by following a link is k = (k 2 )/(k) [16]. For 
scale-free graphs k can be approximated by [16] 

/A-2A /K 3 - x -m 3 - x \ 
K= {—3){ K^- m ^ )- (8) 

This will be the average degree for sites reached in this 
layer, whose degree is k < N(k)/xi- Therefore, k should 
be calculated using the new cutoff (7), from (8) follows 



2 




FIG. 1. Illustration of the exposure process. The large cir- 
cle denotes the exposed fraction of the giant component, while 
the small circles denote individual sites. The sites on the right 
have not been reached yet. (a) The structure after the expo- 
sure of the Zth layer, (b) The structure after the exposure of 
the I + 1 layer. 

Using the above considerations, the number of outgo- 
ing links from the I + 1 layer can be calculated. To this 
end we consider the total degree of all sites reached in 
the I + 1 level. This includes all sites with degree k, 
Ki + i < k < Ki, as well as other sites with average de- 
gree proportional to k — 1 links (the —1 is due to one 
link going inwards). Thus, the value of k is calculated 
using the cutoff Ki +1 . Loops within a layer and multiple 
links connecting the same site in the I + 1 layer can be 
neglected since as long as the number of sites in the layer 
are of order less than TV, they are negligible in the limit 
TV — > oo. The two contributions can be written as the 
sum of two terms: 

Xi+i = N (k - l)p(k)dk + xi HK l+1 ) - 1]. (9) 

JK l+1 



Noting that p(k) oc k~ x and that k oc K 3 ~ x [16] it fol- 
lows that xi+i « NKf~ x (where both terms in Eq. (9) 
contribute the same order). This can be written as a 
second recurrence equation: 



Xi+i = ANKf- 



(10) 



,A-2 /(3 _ A) = ^Apl)m 



where A = (fc)ra" , v „ , v - (A _ 2)(3 _ A) - 

Solving the equations (7) and (10) yields the result, 



Xi 



(A-2)'-l (A-2)' + 1 

A ~ TV 



(11) 



where xi is t ne number of outgoing links from the Zth 
layer. Eq. (7) then leads to: 



Ki 



A-2)' 



t (A-2)' 



Using the same considerations that follow Eq. (6), one 
can deduce that here also 

d«mlnTV/ln(A-2). (13) 

Our result, Eq. (13), is consistent with the observations 
that the distance in the Internet network is extremely 
small and that the distance in metabolic scale free net- 
works is almost independent of TV [10]. These results can 
be explained by the fact that In In TV is almost a constant 
over many orders of magnitude. Our arguments show, 
however, that for a fixed distribution and very large val- 
ues of TV no scale-free graph with A > 2 can have a con- 
stant diameter. However, for A = 2, since the highest 
degree site has order TV links, we expect that for this 
case d w const. 

For A > 3 and TV ^> 1, k is independent of TV, and since 
the second term of Eq. (9) is dominant, Eq. (9) reduces 
to xi+i = l)Xi> where n is a constant depending only 
on A. This leads to the known result xi ~ C(N, A)(/c — 1)' 
and the radius of the network Z oc In TV [23] . 

For A = 3, Eq. (9) reduces to Xi+i = Xi m Xz- Tak- 
ing the logarithm of this equation one obtains lnx;+i — 
ln%j = lnlnx;. Defining g(l) = hixi and approximat- 
ing the difference equation by the differential equation 
g' = Ing. This equation can not be solved exactly. How- 
ever, taking u = Ing the equation reduces to 



In In AT 
In In s/N 



1= I e u ~ lnu du. 



(14) 



The lower bound is obtained from the highest degree site 
for A = 3, having degree K = toa/TV- Thus, xo = toa/TV- 
The upper bound is the result of searching I for which 
Xi <~ TV with lower order corrections. The integral in 
Eq. (14) can be approximated by the steepest descent 
method, leading to 



I w In TV/ (In In TV), 



(15) 



(12) 



assuming In In TV 3> 1 . 

The above result, Eq. (15), has been obtained rigor- 
ously for the maximum distance in the Barabasi- Albert 
(BA) model [24], having A = 3 (for m > 2) [25]. Al- 
though the result in [25] is for the largest distance be- 
tween two sites, their derivation makes it clear that the 
average distance will also behave similarly. For to = 1 in 
the Barabasi- Albert model the graph turn into trees and 
the behavior of d <~ In TV is obtained [25,26]. It should 
be noted that for to = 1 the giant component in the ran- 
dom model contains only a fraction of the sites (while for 
to > 2 it contains all the sites — at least in the lead- 
ing order). The BA model, on the other hand, is fully 
connected for every to. This might explain why exact 
trees and BA trees are different from generalized random 
graphs. 
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Our derivation is valid for uncorrelated networks. For 
assortative networks [19] the diameter is expected to be 
even smaller, as mentioned earlier. For disassortative 
networks we would expect the odd layers to hold high de- 
gree nodes and the even layers to hold low degree nodes, 
so it is plausible that the scaling of the diameter is the 
same, with some possible constant factor < 2. Note that 
this argument may not be valid for disassortative net- 
works with m = 1, where many dead-ends exist. 

In summary, we have shown that scale free graphs have 
diameter d ~ In In N, which is smaller than the d <~ In N 
behavior, expected for regular random graphs. For every 
A > 2 scale free graphs can be built to have a diameter 
of order d ~ In In N. If random scale free graphs are con- 
sidered, only for 2 < A < 3, the behavior d ~ lnlniV is 
obtained, while for A > 3 the usual result d ~ In TV is 
recovered. 

Note: After this manuscript [27] has been submitted, 
two other manuscript have been submitted that confirm 
our results [28,29]. 
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